# Visual Feature Extraction

Dinov2 Base ONNX
This is the ONNX format version of the facebook/dinov2-base model, suitable for computer vision tasks.
Transformers
D
onnx-community
19
0
Dinov2.giant.patch 14.reg 4
Apache-2.0
DINOv2 is a visual feature extraction model based on Vision Transformer (ViT), which enhances feature extraction capabilities by introducing register mechanisms.
D
refiners
17
0
C RADIO
Other
A visual feature extraction model developed by NVIDIA for generating image embeddings, supporting downstream tasks such as image classification.
Transformers
C
nvidia
398
14
Dinov2 Large
DINOv2 is a visual model released by Facebook Research that extracts general visual features through self-supervised learning, suitable for various downstream tasks.
Transformers
D
Xenova
82
1
Dpt Dinov2 Giant Kitti
Apache-2.0
DPT framework using DINOv2 as the backbone network for depth estimation tasks.
3D Vision Transformers
D
facebook
56
0
Dpt Dinov2 Large Kitti
Apache-2.0
This model employs the DPT framework with DINOv2 as the backbone network, focusing on depth estimation tasks.
3D Vision Transformers
D
facebook
26
2
Autotrain Ex And Pt 3122688388
This is a multi-category image classification model trained using AutoTrain, capable of recognizing various object categories.
Image Classification Transformers
A
Lloviant
17
0
Cvt 13
Apache-2.0
CvT-13 is a hybrid architecture model combining convolutional neural networks and vision transformers, pre-trained on the ImageNet-1k dataset, suitable for image classification tasks.
Image Classification Transformers
C
microsoft
21.80k
11
Regnet Y 032
Apache-2.0
RegNet image classification model trained on ImageNet-1k, featuring an efficient network structure designed through neural architecture search
Image Classification Transformers
R
facebook
21
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase